Specialized Research Datasets in the CiteSeerX Digital Library
نویسندگان
چکیده
منابع مشابه
CiteSeerX: AI in a Digital Library Search Engine
CiteSeerX is a digital library search engine that provides access to more than 4 million academic documents with nearly a million users and millions of hits per day. Artificial intelligence (AI) technologies are used in many components of CiteSeerX e.g. to accurately extract metadata, intelligently crawl the web, and ingest documents. We present key AI technologies used in the following compone...
متن کاملUtility-Based Control Feedback in a Digital Library Search Engine: Cases in CiteSeerX
We describe a utility-based feedback control model and its applications within an open access digital library search engine – CiteSeerX, the new version of CiteSeer. CiteSeerX leverages user-based feedback to correct metadata and reformulate the citation graph. New documents are automatically crawled using a focused crawler for indexing. Those documents that are ingested have their document URL...
متن کاملResearch Questions for the Digital Era Library
THECHANGING IXFORMATIOX E S V I R O N M E N T and the changing expectations and demands of library users are forcing libraries to reasscss their role in the digital age. Amidst this change there is a fundamental constantthe need for access to high-quality research materials. Success in the new environment will require learning much more than we now know about the use of digital resources, their...
متن کاملScalability Bottlenecks of the CiteSeerX Digial Library Search Engine
As the document collection and user population increase, the capability and performance of a digital library such as CiteSeerX maybe limited by some bottlenecks. This paper describes the current infrastructure of the CiteSeerX academic digital library search engine, outlines its current bottlenecks and proposes feasible solutions. These bottlenecks exist in various components of the system incl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: D-Lib Magazine
سال: 2012
ISSN: 1082-9873
DOI: 10.1045/july2012-bhatia